Speech Processing: A Dynamic and Optimization-oriented Approach
Autor Li Deng, Douglas O'Shaughnessy
Publicado por CRC Press, 2003
ISBN 0824740408, 9780824740405
752 páginas

pág 50

CEPSTRUM
la transformada inversa del logaritmo del producto de señales.

Cepstral ANALYSIS OF SPEECH

La voz humana es la convolución de una señal de excitación con la respuesta del tracto vocal. El análisis de voz, en muchos casos, consiste en estimar los parámetros de este modelo de producción básico.
Es necesario realizar una deconvolución, siendo la producción de dos señales a partir de una, un proceso no determinístico. sin embargo, en el caso de la voz se obtiene cierto éxito debido a que las componentes principales tienen un comportamiento tiempo-frecuencia muy diferente.

La deconvolución espectral convierte el producto de dos espectros en la suma de dos señales. Si estas señales son lo suficientemente distintas, pueden filtrarse linealmente.
La componente del tracto vocal tiene curvas formantes suaves (su espectro varía poco en frecuencia), mientras que la componente de excitación es mucho más irregular (debido a las armónicas o al ruido de excitación?), es por estas características que pueden ser separadas linealmente.

-------------------------------------------------------------------------------------------------------------------------------

Recent advancements in communication systems performance have been only possible because of digital signal processing applied in all areas of communication systems development and implementation. Advanced Signal Processing for Communication Systems consists of 20 contributions from researchers and experts. The first group of chapters deals with the audio and video processing for communications applications, including topics ranging from multimedia content delivery over the Internet, through the speech processing and recognition to recognition of non-speech sounds that can be attributed to the surrounding environment. Significant attention is given to orthogonal frequency division multiplexing (OFDM) in its various forms, e.g. HIPERLAN, IEEE 802, 11 a. Aspects of OFDM technology covered include novel forms of modulation and coding, methods of reducing in-band and out-of-band spurious signal generation, and means of reducing the peak-to-average power ratio of an OFDM waveform. In these contributions, a key objective is to return the inherent implementational simplicity of the OFDM technique while enhancing its performance relative to single carrier systems. Digital signal processing for second and third generation systems is represented in the book as well. The topics cover both theoretical issues like spreading sequence design and implementation issues of 3G user equipment modem, and MMSE receivers for CDMA systems. A useful comparison of complexity of channel estimation, equalization and decoding for GSM receivers is discussed, too. The book also includes sections on applications of error control coding, information theory, and digital signal processing for communication systems like modulation, software-defined radio, and channel estimation. Advanced Signal Processing for Communication Systems is written for researchers working on communication systems and signal processing, as well as telecommunications industry professionals.

Más información
Advanced Signal Processing for Communication Systems
Autor Tadeusz Wysocki, Mike Darnell, Bahram Honary
Publicado por Springer, 2002
ISBN 1402072023, 9781402072024
320 páginas





Science fiction has long been populated with conversational computers and robots. Now, speech synthesis and recognition have matured to the point where a wide range of real-world applications are within our grasp. This book takes the first interdisciplinary look at what we know about voice processing, where our technologies stand, and what the future may hold for this fascinating field.

Más información
Voice Communication Between Humans and Machines
Autor David B. Roe, Jay G. Wilpon, National Academy of Sciences (U.S.)
Colaborador David B. Roe, Jay G. Wilpon
Publicado por National Academies Press, 1994
ISBN 0309049881, 9780309049887
548 páginas

Diccionario de Informática, Telecomunicaciones y Ciencias Afines: Inglés-español= Dictionary Of Computing, Telecommunications, and Related Sciences: Spanish-english
Autor Mario León
Publicado por Ediciones Díaz de Santos, 2004
ISBN 8479786264, 9788479786267
1348 páginas


Pattern Recognition Technologies and Applications: Recent Advances
Autor Brijesh Verma, Michael Blumenstein
Publicado por Idea Group Inc (IGI), 2008
ISBN 1599048078, 9781599048079
435 páginas

Spoken Language Processing: A Guide to Theory, Algorithm, and System Development
Autor Xuedong Huang, Alejandro Acero, Alex Acero, Hsiao-Wuen Hon
Publicado por Prentice Hall PTR, 2001
Procedente de la Universidad de Michigan
Digitalizado el 21 Nov 2007
ISBN 0130226165, 9780130226167
980 páginas

El propsito de este libro es introducir al lector, de una manera clara y concisa, pero a la vez rigurosa, en un mundo del procesado de seales sonoras, haciendo especial nfasis en el procesado digital de la seal de voz. Se trata de una obra didctica que trata de cubrir todos los contenidos que debe de aprender un ingeniero en su formacin como tcnico especializado en procesado digital de sonido, tanto desde un punto de vista terico como prctico. La obra se estructura en nueve captulos donde se tratan desde la produccin y percepcin de sonido, anlisis de la seal de sonora, codificacin de las seales de voz y audio, efectos y tratamiento de audio, hasta una completa descripcin de los sistemas de conversin texto-voz y los sistemas de reconocimiento de voz y locutores. Este libro cuenta con CD complementario disponible en www.lulu.com que contiene los cdigos abiertos de numerosos programas realizados en MATLAB para la resolucin de los ejercicios incluidos en el libro.

Más información
Procesado digital de la Seal Sonora utilizando Matlab
Autor Antonio Pereira Rama
Publicado por Lulu.com, 2007
ISBN 1847991122, 9781847991126
504 páginas

Over the last 20 years, approaches to designing speech and language processing algorithms have moved from methods based on linguistics and speech science to data-driven pattern recognition techniques. These techniques have been the focus of intense, fast-moving research and have contributed to significant advances in this field. Pattern Recognition in Speech and Language Processing offers a systematic, up-to-date presentation of these recent developments. It begins with the fundamentals and recent theoretical advances in pattern recognition, with emphasis on classifier design criteria and optimization procedures. The focus then shifts to the application of these techniques to speech processing, with chapters exploring advances in applying pattern recognition to real speech and audio processing systems. The final section of the book examines topics related to pattern recognition in language processing: topics that represent promising new trends with direct impact on information processing systems for the Web, broadcast news, and other content-rich information resources. Each self-contained chapter includes figures, tables, diagrams, and references. The collective effort of experts at the forefront of the field, Pattern Recognition in Speech and Language Processing offers in-depth, insightful discussions on new developments and contains a wealth of information integral to the further development of human-machine communications.

Más información
Pattern Recognition in Speech and Language Processing
Autor Wu Chou, Biing-Hwang Juang
Colaborador Wu Chou, Biing-Hwang Juang
Publicado por CRC Press, 2003
ISBN 0849312329, 9780849312328
394 páginas

Paper
Hasan Rashidul, Jamil Mustafa, Rabbani Golam y Rahman Saifur. Speaker Identification Using MFCC. Bangladesh University of Engineering and Technology. 2004.

Molau Sirko, Pitz Michael, Schlüter Ralf y Ney Hermann. Computing MFCC on te power spectrum. University of Technology. Aachen, Germany. 

